Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 296 |
| Missing cells | 415 |
| Missing cells (%) | 5.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 75.3 KiB |
| Average record size in memory | 260.4 B |
Variable types
| NUM | 15 |
|---|---|
| BOOL | 8 |
| CAT | 3 |
| DATE | 1 |
Reproduction
| Analysis started | 2020-05-05 17:15:33.933246 |
|---|---|
| Analysis finished | 2020-05-05 17:16:17.190420 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
month is highly correlated with quarter and 1 other fields | High Correlation |
quarter is highly correlated with month and 1 other fields | High Correlation |
weekofyear is highly correlated with quarter and 1 other fields | High Correlation |
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventa | High Correlation |
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresa | High Correlation |
udsstock has 98 (33.1%) missing values | Missing |
udsventa has 63 (21.3%) missing values | Missing |
udsprevisionempresa has 81 (27.4%) missing values | Missing |
roll4wd_udsventa has 50 (16.9%) missing values | Missing |
meanwd_udsventa has 42 (14.2%) missing values | Missing |
roll4wd_udsstock has 17 (5.7%) missing values | Missing |
roll4wd_udsprevisionempresa has 64 (21.6%) missing values | Missing |
weekday has 42 (14.2%) zeros | Zeros |
sin_weekday has 42 (14.2%) zeros | Zeros |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10650.0 |
|---|---|
| Minimum | 30 |
| Maximum | 21270 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 1092 |
| Q1 | 5340 |
| median | 10650 |
| Q3 | 15960 |
| 95-th percentile | 20208 |
| Maximum | 21270 |
| Range | 21240 |
| Interquartile range (IQR) | 10620 |
Descriptive statistics
| Standard deviation | 6162.628011 |
|---|---|
| Coefficient of variation (CV) | 0.5786505174 |
| Kurtosis | -1.2 |
| Mean | 10650 |
| Median Absolute Deviation (MAD) | 5328 |
| Skewness | 0 |
| Sum | 3152400 |
| Variance | 37977984 |
| Value | Count | Frequency (%) | |
| 11334 | 1 | 0.3% | |
| 10398 | 1 | 0.3% | |
| 10686 | 1 | 0.3% | |
| 2694 | 1 | 0.3% | |
| 18822 | 1 | 0.3% | |
| 16878 | 1 | 0.3% | |
| 9750 | 1 | 0.3% | |
| 9894 | 1 | 0.3% | |
| 10038 | 1 | 0.3% | |
| 2190 | 1 | 0.3% | |
| Other values (286) | 286 | 96.6% |
| Value | Count | Frequency (%) | |
| 30 | 1 | 0.3% | |
| 102 | 1 | 0.3% | |
| 174 | 1 | 0.3% | |
| 246 | 1 | 0.3% | |
| 318 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 21270 | 1 | 0.3% | |
| 21198 | 1 | 0.3% | |
| 21126 | 1 | 0.3% | |
| 21054 | 1 | 0.3% | |
| 20982 | 1 | 0.3% |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| Minimum | 2019-06-05 00:00:00 |
|---|---|
| Maximum | 2020-03-26 00:00:00 |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 42 |
|---|
| Value | Count | Frequency (%) | |
| 42 | 296 | 100.0% |
Length
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 2 | 100.0% |
| Distinct count | 107 |
|---|---|
| Unique (%) | 54.0% |
| Missing | 98 |
| Missing (%) | 33.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 873.520202020202 |
|---|---|
| Minimum | 116.0 |
| Maximum | 2209.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 116 |
|---|---|
| 5-th percentile | 324.65 |
| Q1 | 603.25 |
| median | 809 |
| Q3 | 1092.5 |
| 95-th percentile | 1564.55 |
| Maximum | 2209 |
| Range | 2093 |
| Interquartile range (IQR) | 489.25 |
Descriptive statistics
| Standard deviation | 391.7146313 |
|---|---|
| Coefficient of variation (CV) | 0.4484322519 |
| Kurtosis | 0.6721510347 |
| Mean | 873.520202 |
| Median Absolute Deviation (MAD) | 305.5072952 |
| Skewness | 0.7127143265 |
| Sum | 172957 |
| Variance | 153440.3524 |
| Value | Count | Frequency (%) | |
| 736 | 9 | 3.0% | |
| 765 | 7 | 2.4% | |
| 649 | 4 | 1.4% | |
| 475 | 4 | 1.4% | |
| 794 | 4 | 1.4% | |
| 610 | 4 | 1.4% | |
| 862 | 4 | 1.4% | |
| 571 | 4 | 1.4% | |
| 1017 | 4 | 1.4% | |
| 1085 | 4 | 1.4% | |
| Other values (97) | 150 | 50.7% | |
| (Missing) | 98 | 33.1% |
| Value | Count | Frequency (%) | |
| 116 | 1 | 0.3% | |
| 135 | 2 | 0.7% | |
| 145 | 1 | 0.3% | |
| 174 | 1 | 0.3% | |
| 213 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 2209 | 1 | 0.3% | |
| 1967 | 3 | 1.0% | |
| 1889 | 1 | 0.3% | |
| 1841 | 1 | 0.3% | |
| 1763 | 2 | 0.7% |
| Distinct count | 96 |
|---|---|
| Unique (%) | 41.2% |
| Missing | 63 |
| Missing (%) | 21.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 538.7467811158798 |
|---|---|
| Minimum | 88.0 |
| Maximum | 1830.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 88 |
|---|---|
| 5-th percentile | 252 |
| Q1 | 391 |
| median | 523 |
| Q3 | 656 |
| 95-th percentile | 853.8 |
| Maximum | 1830 |
| Range | 1742 |
| Interquartile range (IQR) | 265 |
Descriptive statistics
| Standard deviation | 230.7791531 |
|---|---|
| Coefficient of variation (CV) | 0.4283629363 |
| Kurtosis | 8.000659612 |
| Mean | 538.7467811 |
| Median Absolute Deviation (MAD) | 164.541583 |
| Skewness | 1.882204417 |
| Sum | 125528 |
| Variance | 53259.0175 |
| Value | Count | Frequency (%) | |
| 546 | 8 | 2.7% | |
| 420 | 6 | 2.0% | |
| 487 | 6 | 2.0% | |
| 523 | 5 | 1.7% | |
| 280 | 5 | 1.7% | |
| 605 | 5 | 1.7% | |
| 701 | 5 | 1.7% | |
| 597 | 5 | 1.7% | |
| 479 | 5 | 1.7% | |
| 575 | 4 | 1.4% | |
| Other values (86) | 179 | 60.5% | |
| (Missing) | 63 | 21.3% |
| Value | Count | Frequency (%) | |
| 88 | 1 | 0.3% | |
| 154 | 1 | 0.3% | |
| 177 | 2 | 0.7% | |
| 184 | 1 | 0.3% | |
| 199 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 1830 | 1 | 0.3% | |
| 1815 | 1 | 0.3% | |
| 1424 | 1 | 0.3% | |
| 1298 | 1 | 0.3% | |
| 1107 | 1 | 0.3% |
| Distinct count | 202 |
|---|---|
| Unique (%) | 94.0% |
| Missing | 81 |
| Missing (%) | 27.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2763.753488372093 |
|---|---|
| Minimum | 85.0 |
| Maximum | 15421.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 85 |
|---|---|
| 5-th percentile | 355.3 |
| Q1 | 1224.5 |
| median | 2280 |
| Q3 | 3680 |
| 95-th percentile | 6412.6 |
| Maximum | 15421 |
| Range | 15336 |
| Interquartile range (IQR) | 2455.5 |
Descriptive statistics
| Standard deviation | 2253.619448 |
|---|---|
| Coefficient of variation (CV) | 0.8154198475 |
| Kurtosis | 7.605839069 |
| Mean | 2763.753488 |
| Median Absolute Deviation (MAD) | 1615.708772 |
| Skewness | 2.119297 |
| Sum | 594207 |
| Variance | 5078800.617 |
| Value | Count | Frequency (%) | |
| 3285 | 3 | 1.0% | |
| 960 | 2 | 0.7% | |
| 88 | 2 | 0.7% | |
| 4752 | 2 | 0.7% | |
| 369 | 2 | 0.7% | |
| 206 | 2 | 0.7% | |
| 1160 | 2 | 0.7% | |
| 5502 | 2 | 0.7% | |
| 1974 | 2 | 0.7% | |
| 2386 | 2 | 0.7% | |
| Other values (192) | 194 | 65.5% | |
| (Missing) | 81 | 27.4% |
| Value | Count | Frequency (%) | |
| 85 | 1 | 0.3% | |
| 88 | 2 | 0.7% | |
| 134 | 1 | 0.3% | |
| 147 | 1 | 0.3% | |
| 172 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 15421 | 1 | 0.3% | |
| 13811 | 1 | 0.3% | |
| 12574 | 1 | 0.3% | |
| 8348 | 1 | 0.3% | |
| 8230 | 1 | 0.3% |
promo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 16 |
| Value | Count | Frequency (%) | |
| 0 | 280 | 94.6% | |
| 1 | 16 | 5.4% |
festivo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 8 |
| Value | Count | Frequency (%) | |
| 0 | 288 | 97.3% | |
| 1 | 8 | 2.7% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9966216216216215 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.997453142 |
|---|---|
| Coefficient of variation (CV) | 0.6665683542 |
| Kurtosis | -1.241520413 |
| Mean | 2.996621622 |
| Median Absolute Deviation (MAD) | 1.706560446 |
| Skewness | 0.004680305814 |
| Sum | 887 |
| Variance | 3.989819056 |
| Value | Count | Frequency (%) | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 0 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 2 | 43 | 14.5% | |
| 3 | 43 | 14.5% | |
| 4 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% |
| Distinct count | 4 |
|---|---|
| Unique (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 4 | |
|---|---|
| 3 | |
| 1 | |
| 2 |
| Value | Count | Frequency (%) | |
| 4 | 92 | 31.1% | |
| 3 | 92 | 31.1% | |
| 1 | 86 | 29.1% | |
| 2 | 26 | 8.8% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 10 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.993243243243243 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.667533456 |
|---|---|
| Coefficient of variation (CV) | 0.5244395666 |
| Kurtosis | -1.215710455 |
| Mean | 6.993243243 |
| Median Absolute Deviation (MAD) | 3.109751644 |
| Skewness | -0.3478227975 |
| Sum | 2070 |
| Variance | 13.45080165 |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 10 | 31 | 10.5% | |
| 8 | 31 | 10.5% | |
| 7 | 31 | 10.5% | |
| 1 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 9 | 30 | 10.1% | |
| 2 | 29 | 9.8% | |
| 6 | 26 | 8.8% | |
| 3 | 26 | 8.8% |
| Value | Count | Frequency (%) | |
| 1 | 31 | 10.5% | |
| 2 | 29 | 9.8% | |
| 3 | 26 | 8.8% | |
| 6 | 26 | 8.8% | |
| 7 | 31 | 10.5% |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 10 | 31 | 10.5% | |
| 9 | 30 | 10.1% | |
| 8 | 31 | 10.5% |
| Distinct count | 43 |
|---|---|
| Unique (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.469594594594593 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 31 |
| Q3 | 42 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 15.97664889 |
|---|---|
| Coefficient of variation (CV) | 0.561182873 |
| Kurtosis | -1.229228509 |
| Mean | 28.46959459 |
| Median Absolute Deviation (MAD) | 13.65613587 |
| Skewness | -0.3266565044 |
| Sum | 8427 |
| Variance | 255.2533097 |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 29 | 7 | 2.4% | |
| 28 | 7 | 2.4% | |
| 27 | 7 | 2.4% | |
| 26 | 7 | 2.4% | |
| 25 | 7 | 2.4% | |
| 24 | 7 | 2.4% | |
| 12 | 7 | 2.4% | |
| 11 | 7 | 2.4% | |
| Other values (33) | 226 | 76.4% |
| Value | Count | Frequency (%) | |
| 1 | 7 | 2.4% | |
| 2 | 7 | 2.4% | |
| 3 | 7 | 2.4% | |
| 4 | 7 | 2.4% | |
| 5 | 7 | 2.4% |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 50 | 7 | 2.4% | |
| 49 | 7 | 2.4% | |
| 48 | 7 | 2.4% |
working_day
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 424.0 B |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 246 | 83.1% | |
| False | 50 | 16.9% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004759498821957385 |
|---|---|
| Minimum | -0.9749279121818236 |
| Maximum | 0.9749279121818236 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9749279122 |
|---|---|
| 5-th percentile | -0.9749279122 |
| Q1 | -0.7818314825 |
| median | 0 |
| Q3 | 0.7818314825 |
| 95-th percentile | 0.9749279122 |
| Maximum | 0.9749279122 |
| Range | 1.949855824 |
| Interquartile range (IQR) | 1.563662965 |
Descriptive statistics
| Standard deviation | 0.7086201304 |
|---|---|
| Coefficient of variation (CV) | 148.8854514 |
| Kurtosis | -1.50521649 |
| Mean | 0.004759498822 |
| Median Absolute Deviation (MAD) | 0.6270716718 |
| Skewness | -0.0106157593 |
| Sum | 1.408811651 |
| Variance | 0.5021424891 |
| Value | Count | Frequency (%) | |
| 0.4338837391 | 43 | 14.5% | |
| 0.9749279122 | 43 | 14.5% | |
| -0.4338837391 | 42 | 14.2% | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| 0.7818314825 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% | |
| 0 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 0.9749279122 | 43 | 14.5% | |
| 0.7818314825 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% | |
| 0 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% |
cos_weekday
Real number (ℝ)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0037955736549281846 |
|---|---|
| Minimum | -0.9009688679024191 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9009688679 |
|---|---|
| 5-th percentile | -0.9009688679 |
| Q1 | -0.9009688679 |
| median | -0.222520934 |
| Q3 | 0.6234898019 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1.900968868 |
| Interquartile range (IQR) | 1.52445867 |
Descriptive statistics
| Standard deviation | 0.7079619739 |
|---|---|
| Coefficient of variation (CV) | -186.5230498 |
| Kurtosis | -1.503349059 |
| Mean | -0.003795573655 |
| Median Absolute Deviation (MAD) | 0.6408877408 |
| Skewness | 0.009053080122 |
| Sum | -1.123489802 |
| Variance | 0.5012101565 |
| Value | Count | Frequency (%) | |
| -0.222520934 | 43 | 14.5% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.9009688679 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9009688679 | 42 | 14.2% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% |
is_august
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 31 |
| Value | Count | Frequency (%) | |
| 0 | 265 | 89.5% | |
| 1 | 31 | 10.5% |
spring
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 5 |
| Value | Count | Frequency (%) | |
| 0 | 291 | 98.3% | |
| 1 | 5 | 1.7% |
summer
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 188 | 63.5% | |
| 1 | 108 | 36.5% |
autumn
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 206 | 69.6% | |
| 1 | 90 | 30.4% |
winter
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 200 | 67.6% | |
| 1 | 96 | 32.4% |
stockMissingType
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 11 |
| Value | Count | Frequency (%) | |
| 0 | 198 | 66.9% | |
| 2 | 87 | 29.4% | |
| 1 | 11 | 3.7% |
Length
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 3 | 75.0% | |
| Other_Punctuation | 1 | 25.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 239 |
|---|---|
| Unique (%) | 97.2% |
| Missing | 50 |
| Missing (%) | 16.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 537.2568960511034 |
|---|---|
| Minimum | 268.625 |
| Maximum | 1387.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 268.625 |
|---|---|
| 5-th percentile | 314.28125 |
| Q1 | 411.5535714 |
| median | 535.7589286 |
| Q3 | 623.3125 |
| 95-th percentile | 766.71875 |
| Maximum | 1387 |
| Range | 1118.375 |
| Interquartile range (IQR) | 211.7589286 |
Descriptive statistics
| Standard deviation | 163.5755164 |
|---|---|
| Coefficient of variation (CV) | 0.3044642472 |
| Kurtosis | 3.514143979 |
| Mean | 537.2568961 |
| Median Absolute Deviation (MAD) | 123.2683072 |
| Skewness | 1.154097903 |
| Sum | 132165.1964 |
| Variance | 26756.94956 |
| Value | Count | Frequency (%) | |
| 414.5 | 2 | 0.7% | |
| 583.5 | 2 | 0.7% | |
| 555 | 2 | 0.7% | |
| 608.375 | 2 | 0.7% | |
| 682.125 | 2 | 0.7% | |
| 491.25 | 2 | 0.7% | |
| 347.375 | 2 | 0.7% | |
| 329.75 | 1 | 0.3% | |
| 462.5 | 1 | 0.3% | |
| 554.7142857 | 1 | 0.3% | |
| Other values (229) | 229 | 77.4% | |
| (Missing) | 50 | 16.9% |
| Value | Count | Frequency (%) | |
| 268.625 | 1 | 0.3% | |
| 269.125 | 1 | 0.3% | |
| 276.5 | 1 | 0.3% | |
| 282.625 | 1 | 0.3% | |
| 287 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 1387 | 1 | 0.3% | |
| 1225.714286 | 1 | 0.3% | |
| 1026.428571 | 1 | 0.3% | |
| 1023.625 | 1 | 0.3% | |
| 924 | 1 | 0.3% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 42 |
| Missing (%) | 14.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 539.0817874183757 |
|---|---|
| Minimum | 425.43243243243245 |
| Maximum | 721.421052631579 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 425.4324324 |
|---|---|
| 5-th percentile | 425.4324324 |
| Q1 | 427.575 |
| median | 546.7105263 |
| Q3 | 627.075 |
| 95-th percentile | 721.4210526 |
| Maximum | 721.4210526 |
| Range | 295.9886202 |
| Interquartile range (IQR) | 199.5 |
Descriptive statistics
| Standard deviation | 107.504357 |
|---|---|
| Coefficient of variation (CV) | 0.1994212372 |
| Kurtosis | -1.13015028 |
| Mean | 539.0817874 |
| Median Absolute Deviation (MAD) | 92.67711065 |
| Skewness | 0.5035262163 |
| Sum | 136926.774 |
| Variance | 11557.18677 |
| Value | Count | Frequency (%) | |
| 627.075 | 43 | 14.5% | |
| 546.7105263 | 43 | 14.5% | |
| 427.575 | 42 | 14.2% | |
| 721.4210526 | 42 | 14.2% | |
| 425.4324324 | 42 | 14.2% | |
| 484 | 42 | 14.2% | |
| (Missing) | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 425.4324324 | 42 | 14.2% | |
| 427.575 | 42 | 14.2% | |
| 484 | 42 | 14.2% | |
| 546.7105263 | 43 | 14.5% | |
| 627.075 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 721.4210526 | 42 | 14.2% | |
| 627.075 | 43 | 14.5% | |
| 546.7105263 | 43 | 14.5% | |
| 484 | 42 | 14.2% | |
| 427.575 | 42 | 14.2% |
| Distinct count | 239 |
|---|---|
| Unique (%) | 85.7% |
| Missing | 17 |
| Missing (%) | 5.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 854.271556579621 |
|---|---|
| Minimum | 234.75 |
| Maximum | 2209.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 234.75 |
|---|---|
| 5-th percentile | 407 |
| Q1 | 643.1071429 |
| median | 815.8 |
| Q3 | 1039.571429 |
| 95-th percentile | 1364.7 |
| Maximum | 2209 |
| Range | 1974.25 |
| Interquartile range (IQR) | 396.4642857 |
Descriptive statistics
| Standard deviation | 311.2570991 |
|---|---|
| Coefficient of variation (CV) | 0.3643538131 |
| Kurtosis | 1.378515585 |
| Mean | 854.2715566 |
| Median Absolute Deviation (MAD) | 242.720032 |
| Skewness | 0.8119969193 |
| Sum | 238341.7643 |
| Variance | 96880.98173 |
| Value | Count | Frequency (%) | |
| 407 | 6 | 2.0% | |
| 504 | 5 | 1.7% | |
| 765 | 4 | 1.4% | |
| 1085 | 3 | 1.0% | |
| 590.25 | 2 | 0.7% | |
| 790.4 | 2 | 0.7% | |
| 416 | 2 | 0.7% | |
| 769.4285714 | 2 | 0.7% | |
| 1240 | 2 | 0.7% | |
| 678 | 2 | 0.7% | |
| Other values (229) | 249 | 84.1% | |
| (Missing) | 17 | 5.7% |
| Value | Count | Frequency (%) | |
| 234.75 | 1 | 0.3% | |
| 283.25 | 1 | 0.3% | |
| 324.5 | 1 | 0.3% | |
| 329 | 1 | 0.3% | |
| 339 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 2209 | 1 | 0.3% | |
| 2029.75 | 1 | 0.3% | |
| 1850.5 | 1 | 0.3% | |
| 1676 | 1 | 0.3% | |
| 1671.25 | 1 | 0.3% |
meanwd_udsstock
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 866.7360099309544 |
|---|---|
| Minimum | 634.952380952381 |
| Maximum | 1121.2222222222222 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 634.952381 |
|---|---|
| 5-th percentile | 634.952381 |
| Q1 | 686.04 |
| median | 836.6 |
| Q3 | 1026.206897 |
| 95-th percentile | 1121.222222 |
| Maximum | 1121.222222 |
| Range | 486.2698413 |
| Interquartile range (IQR) | 340.1668966 |
Descriptive statistics
| Standard deviation | 171.2294205 |
|---|---|
| Coefficient of variation (CV) | 0.1975566015 |
| Kurtosis | -1.473227951 |
| Mean | 866.7360099 |
| Median Absolute Deviation (MAD) | 155.459912 |
| Skewness | 0.09458312511 |
| Sum | 256553.8589 |
| Variance | 29319.51444 |
| Value | Count | Frequency (%) | |
| 1026.206897 | 43 | 14.5% | |
| 836.6 | 43 | 14.5% | |
| 1121.222222 | 42 | 14.2% | |
| 634.952381 | 42 | 14.2% | |
| 762.2580645 | 42 | 14.2% | |
| 996.7931034 | 42 | 14.2% | |
| 686.04 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 634.952381 | 42 | 14.2% | |
| 686.04 | 42 | 14.2% | |
| 762.2580645 | 42 | 14.2% | |
| 836.6 | 43 | 14.5% | |
| 996.7931034 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1121.222222 | 42 | 14.2% | |
| 1026.206897 | 43 | 14.5% | |
| 996.7931034 | 42 | 14.2% | |
| 836.6 | 43 | 14.5% | |
| 762.2580645 | 42 | 14.2% |
| Distinct count | 229 |
|---|---|
| Unique (%) | 98.7% |
| Missing | 64 |
| Missing (%) | 21.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2830.7518626847295 |
|---|---|
| Minimum | 85.0 |
| Maximum | 15421.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 85 |
|---|---|
| 5-th percentile | 211.0142857 |
| Q1 | 1309.736607 |
| median | 2308.1875 |
| Q3 | 3775.2625 |
| 95-th percentile | 6756.7625 |
| Maximum | 15421 |
| Range | 15336 |
| Interquartile range (IQR) | 2465.525893 |
Descriptive statistics
| Standard deviation | 2357.303113 |
|---|---|
| Coefficient of variation (CV) | 0.8327480569 |
| Kurtosis | 7.09494101 |
| Mean | 2830.751863 |
| Median Absolute Deviation (MAD) | 1646.196307 |
| Skewness | 2.18795865 |
| Sum | 656734.4321 |
| Variance | 5556877.968 |
| Value | Count | Frequency (%) | |
| 88 | 2 | 0.7% | |
| 147 | 2 | 0.7% | |
| 2579.75 | 2 | 0.7% | |
| 2828 | 1 | 0.3% | |
| 2031.8 | 1 | 0.3% | |
| 2380.125 | 1 | 0.3% | |
| 1892.25 | 1 | 0.3% | |
| 2877 | 1 | 0.3% | |
| 2322.625 | 1 | 0.3% | |
| 1991.375 | 1 | 0.3% | |
| Other values (219) | 219 | 74.0% | |
| (Missing) | 64 | 21.6% |
| Value | Count | Frequency (%) | |
| 85 | 1 | 0.3% | |
| 88 | 2 | 0.7% | |
| 104.8571429 | 1 | 0.3% | |
| 147 | 2 | 0.7% | |
| 161.75 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 15421 | 1 | 0.3% | |
| 13811 | 1 | 0.3% | |
| 12574 | 1 | 0.3% | |
| 12204.25 | 1 | 0.3% | |
| 11310.5 | 1 | 0.3% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2267.130010577379 |
|---|---|
| Minimum | 147.0 |
| Maximum | 4634.894736842105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 147 |
|---|---|
| 5-th percentile | 147 |
| Q1 | 1069.916667 |
| median | 2023.605263 |
| Q3 | 3600.410256 |
| 95-th percentile | 4634.894737 |
| Maximum | 4634.894737 |
| Range | 4487.894737 |
| Interquartile range (IQR) | 2530.49359 |
Descriptive statistics
| Standard deviation | 1405.572389 |
|---|---|
| Coefficient of variation (CV) | 0.6199787318 |
| Kurtosis | -0.9070440848 |
| Mean | 2267.130011 |
| Median Absolute Deviation (MAD) | 1169.726349 |
| Skewness | 0.2230081453 |
| Sum | 671070.4831 |
| Variance | 1975633.74 |
| Value | Count | Frequency (%) | |
| 2647.184211 | 43 | 14.5% | |
| 3600.410256 | 43 | 14.5% | |
| 2023.605263 | 42 | 14.2% | |
| 1069.916667 | 42 | 14.2% | |
| 1706.105263 | 42 | 14.2% | |
| 4634.894737 | 42 | 14.2% | |
| 147 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 147 | 42 | 14.2% | |
| 1069.916667 | 42 | 14.2% | |
| 1706.105263 | 42 | 14.2% | |
| 2023.605263 | 42 | 14.2% | |
| 2647.184211 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 4634.894737 | 42 | 14.2% | |
| 3600.410256 | 43 | 14.5% | |
| 2647.184211 | 43 | 14.5% | |
| 2023.605263 | 42 | 14.2% | |
| 1706.105263 | 42 | 14.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 30 | 2019-06-05 | 42 | 765.0 | 523.0 | 12574.0 | 0.0 | 0.0 | 2 | 2 | 6 | 23 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 523.00 | 546.710526 | 765.00 | 836.600000 | 12574.00 | 2647.184211 |
| 1 | 102 | 2019-06-06 | 42 | NaN | 752.0 | 15421.0 | 0.0 | 0.0 | 3 | 2 | 6 | 23 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 752.00 | 627.075000 | NaN | 1026.206897 | 15421.00 | 3600.410256 |
| 2 | 174 | 2019-06-07 | 42 | NaN | 383.0 | 13811.0 | 0.0 | 0.0 | 4 | 2 | 6 | 23 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 383.00 | 721.421053 | NaN | 996.793103 | 13811.00 | 4634.894737 |
| 3 | 246 | 2019-06-08 | 42 | NaN | 553.0 | 5558.0 | 0.0 | 0.0 | 5 | 2 | 6 | 23 | True | -0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 2.0 | 553.00 | 425.432432 | NaN | 1121.222222 | 5558.00 | 1069.916667 |
| 4 | 318 | 2019-06-09 | 42 | NaN | NaN | NaN | 0.0 | 0.0 | 6 | 2 | 6 | 23 | False | -0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 2.0 | NaN | NaN | NaN | 634.952381 | NaN | 147.000000 |
| 5 | 390 | 2019-06-10 | 42 | NaN | 420.0 | 4591.0 | 0.0 | 0.0 | 0 | 2 | 6 | 24 | True | 0.000000 | 1.000000 | 0 | 0 | 1 | 0 | 0 | 2.0 | 420.00 | 484.000000 | NaN | 686.040000 | 4591.00 | 2023.605263 |
| 6 | 462 | 2019-06-11 | 42 | 649.0 | 287.0 | 3404.0 | 0.0 | 0.0 | 1 | 2 | 6 | 24 | True | 0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 0.0 | 287.00 | 427.575000 | 649.00 | 762.258065 | 3404.00 | 1706.105263 |
| 7 | 534 | 2019-06-12 | 42 | 1298.0 | 560.0 | 1797.0 | 0.0 | 0.0 | 2 | 2 | 6 | 24 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 532.25 | 546.710526 | 898.25 | 836.600000 | 9879.75 | 2647.184211 |
| 8 | 606 | 2019-06-13 | 42 | 1153.0 | 686.0 | 2554.0 | 0.0 | 0.0 | 3 | 2 | 6 | 24 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 735.50 | 627.075000 | 1153.00 | 1026.206897 | 12204.25 | 3600.410256 |
| 9 | 678 | 2019-06-14 | 42 | 1037.0 | 1830.0 | 3809.0 | 1.0 | 0.0 | 4 | 2 | 6 | 24 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 744.75 | 721.421053 | 1037.00 | 996.793103 | 11310.50 | 4634.894737 |
Last rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 286 | 20622 | 2020-03-17 | 42 | NaN | 1107.0 | 1254.0 | 0.0 | 0.0 | 1 | 1 | 3 | 12 | True | 0.781831 | 0.623490 | 0 | 0 | 0 | 0 | 1 | 2.0 | 395.375000 | 427.575000 | 819.285714 | 762.258065 | 2155.500 | 1706.105263 |
| 287 | 20694 | 2020-03-18 | 42 | NaN | 501.0 | 2825.0 | 0.0 | 0.0 | 2 | 1 | 3 | 12 | True | 0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 2.0 | 576.750000 | 546.710526 | 1055.500000 | 836.600000 | 3514.625 | 2647.184211 |
| 288 | 20766 | 2020-03-19 | 42 | NaN | 405.0 | 2940.0 | 0.0 | 0.0 | 3 | 1 | 3 | 12 | True | 0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 2.0 | 468.000000 | 627.075000 | 678.000000 | 1026.206897 | 4432.000 | 3600.410256 |
| 289 | 20838 | 2020-03-20 | 42 | 1172.0 | NaN | 3062.0 | 0.0 | 0.0 | 4 | 1 | 3 | 12 | True | -0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 0.0 | 908.142857 | 721.421053 | 871.500000 | 996.793103 | 6285.375 | 4634.894737 |
| 290 | 20910 | 2020-03-21 | 42 | 1967.0 | NaN | NaN | 0.0 | 0.0 | 5 | 1 | 3 | 12 | True | -0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 0.0 | 896.714286 | 425.432432 | 976.400000 | 1121.222222 | NaN | 1069.916667 |
| 291 | 20982 | 2020-03-22 | 42 | 1967.0 | NaN | NaN | 0.0 | 0.0 | 6 | 1 | 3 | 12 | False | -0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | NaN | NaN | 1240.000000 | 634.952381 | NaN | 147.000000 |
| 292 | 21054 | 2020-03-23 | 42 | 1967.0 | NaN | 1160.0 | 0.0 | 0.0 | 0 | 1 | 3 | 13 | True | 0.000000 | 1.000000 | 0 | 1 | 0 | 0 | 1 | 0.0 | 338.000000 | 484.000000 | 1240.000000 | 686.040000 | 2034.500 | 2023.605263 |
| 293 | 21126 | 2020-03-24 | 42 | 1498.0 | NaN | 329.0 | 0.0 | 0.0 | 1 | 1 | 3 | 13 | True | 0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | 644.857143 | 427.575000 | 1057.200000 | 762.258065 | 1618.500 | 1706.105263 |
| 294 | 21198 | 2020-03-25 | 42 | 1498.0 | NaN | 1673.0 | 0.0 | 0.0 | 2 | 1 | 3 | 13 | True | 0.974928 | -0.222521 | 0 | 1 | 0 | 0 | 1 | 0.0 | 546.428571 | 546.710526 | 1378.500000 | 836.600000 | 3054.875 | 2647.184211 |
| 295 | 21270 | 2020-03-26 | 42 | 1498.0 | NaN | 673.0 | 0.0 | 0.0 | 3 | 1 | 3 | 13 | True | 0.433884 | -0.900969 | 0 | 1 | 0 | 0 | 1 | 0.0 | 378.857143 | 627.075000 | 1498.000000 | 1026.206897 | 3388.625 | 3600.410256 |